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Abstract 


A brief description of supercomputers is presented along with a definition and 
illustration of vectorization. To demonstrate the usefulness of the^»e super- 
computers in solving tribological problems, a simple kernel of the fll'm shape 
calculations in an elastohydrodyt^amic lubricated rectangular contact is pre- 
sented, The relevant equations are briefly described. Both scalar and vector 
versions of the film shape code are presented. The run times of the two types 
of code indicate that over a 50-to-l speedup of scalar to vector computational 
time for vector lengths typically used in elastohydrodynamic lubrication anal- 
ysis is obtained. 


Introduction 

The present day powerful computing machines known as supercomputers have peak 
computing speeds exceeding 100 million operative per second. This is to be 
contrasted with the first commerical electronic computer delivered in 1951 
which had a peak computing speed of around 600 operations per second. This 
means that the speed of large scale scientific computers has doubled on the 
average every 2 years. Although the current performance levels of such 
machines owe much to the rapid advance of microelectronics, new concepts in 
computer architecture have been equally important. The term "architecture" 
refers to the logical organization of the computer as it is seen by the pro- 
grammer. The recent architectural innovations of greatest significance are 
those that enable the machine to carry out many similar operations in par- 
allel, or concurrently. Thi's is referred to as vector computation. The 
latest supercomputers allow the programmer to specify that many different ele- 
mentary steps be executed simultaneously, whereas earlier computers obliged 
the programmer to break his computational program down into a sequence of ele- 
mentary steps that would be executed one at a time. This is referred to as 
scalar computation. The speed with which signals can be propagated from one 
part of the computer to another is limited by the speed of light, which is 
close to 0.3 m per nanosecond. To get any further improvement in computation 
speed meant that innovative approaches to the basic architecture of the com- 
puter were required. These circumstances brought about the development of the 
latest supercomputers which utilize vector operations to achieve faster oper- 
ational speeds. Levine (1982) gives a thorough and useful description of 
supercomputers. 

The latest supercomputers represent a quantum jump in the speed of operation 
of large scientific computers, and their effect in solving tribological prob- 
lems of the next decade should be significant. The present paper will attempt 
to demonstrate the effect of using these computers in performing film shape 
calculations in solving elastohydrodynamic lubrication problems. Comparisons 


will be made between scalar and vector operations to demonstrate the utility 
of new computers. The film shape calculations were chosen since in elastohy- 
drodynamic lubrication calculations over 80 percent of the computation time Is 
spent in the film shape calculations. For simplistic considerations a rec- 
tangular contract (sometimes referred to as a line contact) Is studied rather 
than an elliptical contact. 

Film Shape Formulation 

Only the film shape calculations of the elastohydrodynamic lubrication analy- 
sis will be considered in this paper. The film shape for a rectangular con- 
tact can be written as; 


h » hjj + S(x) + a(x) 


( 1 ) 


where { 

hg - constant 

S(x) - separation due to geometry of undeformed solids 
6(x) - elastic deformation 

The separation due to the geometry of the underformed rollers shown in figure 
1(a) can be described by an equivalent cylindrical solid near a plane as shown 
In figure 1(b). The geometrical requirement is that the separation of the two 
rollers In the initial and equivalent situations should be the same at equal 
values of x. Therefore the separation due to the underformed geometry of the 
two rollers can be written as; 


S(x) 
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where: 




(3) 


Equation (2) makes use of the well-known parabolic approximation to the circu- 
lar section of the solid and is valid as long as the separation Is much small- 
er than the radius of/durvature. 

From Timoshenko and Goodler (1951), the elastic deformation at a point of a 
serai-infinite solid subjected to a pressure p along the line x » xi, can be 
written as; . 

5(x)«--^/ p ln(x - x^^ dX;^ (4) 
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where: 


E * ■ W" ( S ) 

1 - 1-7 
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V « Poisson's ratio 
E - modulus of elasticity 

Figure 2 shows a rectangular area of uniform pressure. If the pressure is 
assumed to be uniform over the rectangular area the pressure can be put in 
front of the integral in equation (4). Therefore the integration can be per- 
formed, resulting in the following: 

6(x) » I P D (6) 


where: 

D = b[(X - B)ln(x - B)2 - (X + B)ln(X + B)^ + 4B(1 - In b)] (7) 


b e semi width of Hertzian contact 
X » x/b, dimensionless coordinate 
B « 15/b a 

n = number of nodes within the semiwidth of the contact 

In equation (6) the elastic deformation is found to be equal to a constant 
multiplied by the dimensionless pressure multiplied by a distance influence 
factor. Now the term 6(x) in equation (6) represents the elastic deformation 
at a point A due to a rectangular area of uniform pressure and width 2 d. If 
the conjunction is divided into a number of equal rectangular areas, the total 
deformation at a point A due to the contribution of the various rectangular 
areas of uniform pressure in the conjunction can be evaluated numerically. 

The total elastic deformation caused by the rectangular areas of uniform pres- 
sure within a conjunction can be, written as: 

isl^E, . . . 


where: 

j = |k - i j +1 (10) 
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Therefore substituting equations (2) and (9) Into equation (1) while writing 
the film shape In dimensionless form givesj 
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Scalar and Vector Programs 


Having formulated the film shape, the next task was to program it both scalar- 
ly and vectoriallyi Before proceeding it might be well to point out more spe- 
cifically what is meant by scalar and vector coding. Scalar coding is conven- 
tional coding that has been in existence since the first commerically avail- 
able computer in 1951. Vectorization, or the process of writing programs in 
vector code, consists of designing, organizing, and writing programs so the 
maximum possible number of arithmetic and logical operations are executed as 
vector instructions, Vectorization is most easily introduced and illustrated 
by example. Consider arrays A and B, each consisting of 100 numbers. Assume 
that, one wishes to compute array C, where C-j » Ai + Bj, i » 1,...,100. The 
traditional "scalar" computer executes five assembly language instructions 100 


times. There are two memory fetches (A^ and B-j), one addition, one store to 
memory (for C^), and an instruction that increments a counter, tests, and 
branches back to load the next pair of input operands. Thus, 500 scalar in- 


structions are executed to add arrays A and B. A vector computer *s compiler 
can generate "vector" object code, which executes very differently. The vec- 
tor code for adding the 100 pairs of operands consists of a single hardware 
instruction. The vector instruction executes by continuously streaming oper- 
ands from central memory into the central processor, where the addition takes 
place, and continuously streaming answers back to memory. During execution of 
the vectorized addition, some elements of A and B are being read from central 
memory, some elements of A and B are undergoing addition, and some are being 
written in memory. 


Figures 3 and 4 show the computer code for the film shape of the scalar and 
vector kernels, respectively. The only calculations given in these figures 
and not fc^mulated in the earlier portion of the paper are the calculations 
of 9 where: 


- » (1 - 


(IH) 


and: 

G B aE', dimensionless materials parameter 
a » pressure viscosity coefficient of lubricant 

In the vector film shape code (figure 4) the expressions are written in vector 
form throughout and the IF Statement in figure 3 is completely eliminated. 
Vectorization requires the elimination of IF, CALL, and other interrupts with- 
in DO loops. 
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Results 


Table I shows the scalar and vector computation times expressed in nanoseconds 
for varying vector length. A considerable decrease in computation time is 
seen to exist for the vector computation over that of the scalar computa- 
tion. The last column of the table gives the ratio of scalar to vector com- 
putation times. This column indicates that ^s the vector length increase:-; the 
scalar to vector time ratio increases significantly until the vector length 
becomes too large and an asymptotic condition is approached. The vector 
length used by Hamrock and Jacobson (X982) in evaluating elastohydrodynamic 
lubrication of rectangular contacts was 660. For this example, the table in- 
dicates that over a 50-to-l speedup in the film shape calculations can be ob- 
tained on supercomputers that utilize vecto*' computation, Figure 5 shows the 
results presented in Table I in graphical form. Here the leveling off of the 
scalar to vector computation times as the vector length becomes very large is 
quite apparent. In a typical elastohydrodynamic lubrication run the film 
shape code shown in figure 4 is used 2000 times. 

Conclusions 


A brief description of supercomputers' capabilities has been briefly presen- 
ted, To demonstrate their usefulness in solving tribological problems, a sim- 
ple kernel of the film shape calculations in an elastohydrodynamically lubri- 
cated rectangular contact was presented. Both scalar and vector versions of 
the film shape code were presented. The run times for the two types of code 
indicate that over a 50-to-l speedup of scalar to vector computation time is 
obtained for vector lengths typically used in elastohydrodynamic lubrication 
analysis. The significarice of these results should prove useful in consider- 
ing additional aspects in elastohydrodynamic lubrication such as: 

(1) Thermal effects 

(2) Surface roughness effects 

(3) Non-Newtonian effects of the fluid 

(4) Side leakage effects 

The use of supercomputers in this decade for the incorporation of these 
coupled effects should reduce run times considerably and thereby make such 
calculations more feasible. 
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TABLE I. - COMPARISON OF SCALAR AND VECTOR COMPUTER RUN TIMES 
FOR DIFFERENT VECTOR LENGTHS 


Scalar time, 
nsec 

Vector tlmo, 
nsec 

241 

70 

1 17V 

165 

2 726 

243 

9 252 

483 

73 357 

1 933 

198 282 

4 188 

383 997 

7 242 

776 588 

13 363 

6 890 354 

99 988 

19 084 530 

266 602 

37 358 446 

513 218 

76 171 860 

1 033 169 


Scalar time 


Figure 1, - Rollers and equivalent roller. 


Figure 2 . - Surface deformation of semi-infinite body subjected to 
uniform pressure over a rectangular area. 










D08I-1.NX 
SUM -.000 


DO 10 II -1, NX 
M-IABS (HI)+ 1 
10 SUM- SUM + PR(II)<-D(M) 

W(i: * 2.0DOOSUM/PI 
H(I) - H01+RX-“(S(I)+W(I)) 

IF (H(I).GT.HMIN) GO TO 400 
HMIN - H(I) 

NSAVE-I 
400 CONTINUE 

8 PHUI) - (1.0D0-DEXP(-G<’PR(II))<-(H(I)wi.50D0)/G 


Figure 3. - Scalar film shape code. 


GI- 1.0/G 

DO 500 1. NX 
DD(I) -d'(NX+M) 

500 CONTINUE 

DO 510 1-l, NX 
DD(I+NX-1) - D(I) 

W(I) - PR(1)^DD(I+NX-1) 

510 CONTINUE 

DO 530 K - 2, NX 
DO 520 1 - 1, NX 
WII) - Wdl+PRIKloDDd+NX-K) 

520 CONTINUE 
530 CONTINUE 

DO 540 1 - 1, NX 
mi) - (2.0/PlKWd) 

Hd) - HOl+RX«(Sd)+W(I)) 

PHld) - (1.0-EXP(-G’!'PRd)))«-(Hd)^SQRT(Hd)))*GI 
540 CONTINUE 

NSAVE-Q8SMIN1(H(1;NX))+1 
HMIN - HINSAVE) 


Figure 4. - Vector film shape code. 


SCALAR TIME/VECTOR TIME 



VECTOR LENGTH 


Figure 5. - Effect of scalar run time divided by vector run time on various vector 
lengths. 
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